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Number partitioning is an NP-complete problem of com- 
binatorial optimization. A statistical mechanics analysis re- 
veals the existence of a phase transition that separates the 
easy from the hard to solve instances and that reflects the 
pseudo-polynomiality of number partitioning. The phase di- 
agram and the value of the typical ground state energy are 
calculated. 
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Computer science has recently discovered the notion 
of phase transition in random combinatorial problems 
and its possible connections with algorithmic complex- 
ity. In such a context, statistical physics may provide 
an interesting perspective for understanding problems in 
theoretical computer science. In this letter we calculate 
the statistical mechanics of one of the core problems in 
theoretical computer science. 

The number partitioning problem is an easily for- 
mulated optimization problem: Given a set A = 
{di, 0,2, . • • , ajv} of positive numbers, find a partition, i.e. 
a subset A' C A, such that the residue 



E = 



E a i- E 



(i) 



ajEA' 



is minimized. A partition with E = is called perfect. 
The decision variant of the number partitioning problem 
is to determine if there is a perfect partition or not. 

Number partitioning is of both theoretical and prac- 
tical importance. It is one of Garey and Johnson's six 
basic NP-complctc problems that lie at the heart of the 
theory of NP-completeness [0 . Among the many practi- 
cal applications one finds multiprocessor scheduling and 
the minimization of VLSI circuit size and delay. 

A partition can be encoded by numbers sj = ±1: Sj — 
1 if aj G A', Sj = — 1 otherwise. The cost function then 
reads 



N 



(2) 



and the minimum partition is equivalent to the ground 
state of the Hamiltonian 



H = E = 



N 

E 



Si Q>i&j Sj . 



(3) 



This is an infinite range Ising spin glass with Mattis-like, 



antifcrromagnetic couplings Jij — 



The thermo- 



dynamics of this model has been investigated by Fu 
and recently by Ferreira and Fontanari || . 

Fu claims that in the random number partitioning 
problem ". . . no phase transition of any kind is found." 
Q. If Fu were right, number partitioning would be 
a notable exception to the observation that many NP- 
complcte problems do have a phase transition, parame- 
terized by a control parameter that separates the easy 
from the hard to solve instances Q. For random, integer 
^ e {0, 1, 2, ... , ^4}, Gent and Walsh || proposed 



k = 



log2j4 
N 



(4) 



as a control parameter: they found numerically that 
one typically has 0(2^) perfect partitions for k < n c , 
whereas for k > n c the number of perfect partitions drops 
to zero. The transition gets sharper with increasing N. 
Finite-size scaling lead Gent and Walsh to k c = 0.96 for 
N — * oo. This result contradicts Fu's claim, but as we 
will see now, this type of phase transition can indeed be 
found in the statistical mechanics of the number parti- 
tioning problem. 

The canonical formalism of statistical mechanics re- 
quires the calculation of the partition function 
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where T is the temperature. We write Z as 



Z = 2 



v / - dy NG(y) 



(5) 



(6) 



with 
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1 N 

G ^ = ^E lncos (^ tan (y))- 



(7) 
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At this point we could use the statistical indepen- 
dence of the a,j and replace the sum by the average of 
lncos(y tan(y)) over a. This is usually done in Mattis- 
like spin glasses ||, but we will proceed without this 
substitution and calculate all thermodynamic quantities 
as functions of {a,j}. 

For large N the integral in Eq. (||) can be evaluated us- 
ing the saddlepoint technique. To find the saddlepoints of 
G(y), we will assume that a can only take on values that 
are integer multiples of a fixed number Aa. For integer 
distributions Aa = 1, and for floatingpoint distributions 
Aa is the smallest number that can be represented with 
the available number of bits. This assumptions leads to 
an infinite number of saddlepoints, missed in p], 
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Uk = arctan(— — k) k = 0, ±1, ±2, . 
Aa 



(8) 



The resulting series of Gaussian integrals can be evalu- 
ated: 



Z = 2 



jv 



E 



k=0,±l,. 



dye 
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= 2 



' 2Aa 2 , Aa 



From that we get the average energy 

E _ Aa coth 2 Aa _ i 
T coth ^ 

and the entropy 



(9) 



(10) 



For k < k c the entropy is extensive even for T = 0. 
According to Eq. (Eot), the corresponding energy is zero, 
hence we expect an exponential number of perfect parti- 
tions, in good agreement with the numerical results 0. 

For k > k c the zero temperature entropy seems to be- 
come negative. This would be wrong because the entropy 
must not be smaller than In 2 for our discrete system. To 
see what is going on here, note that k > n c means 



-JY 



> Ac 



a 2 



(16) 



i.e. essentially Aa = 0(2 ). In this regime the contri- 
butions of S are O(N) for any finite T, 



+ 1 + 0(^0, 



(17) 



hence cannot be neglected. Technically we deal with this 
contribution by introducing an effective "zero" temper- 
ature To below which the system can not be cooled. To 
guarantees that the contribution of S remains O(N). Its 
value can be calculated from the lower bound of S: 

In 2 = N(k c - as) In 2 + 5(— ) 

To 



N(k c - k) ln2 + ln(4^-)- 
Aa 



From that we get 



To = 2Aa 2 N ^- K ^ = J2n ^ . a ] 2 



-N 



(18) 
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S = jVln2--ln( ^ 3 J ) +§(—), 
2 V 2Aa 2 ' y 2T h 

where the thermal contribution reads 



a/^ a \ , , Aa 
S{— ) = In coth — 



Aa coth 2 A? _ i 
~T coth As 



(11) 



(12) 



Note that for finte Aa, S vanishes at zero temperature 
and increases monotonically with T. The entropy can be 
written as 



with 



and 



S = N(k c -k)]h2 + S 



, , In(fiV) 



N2hx2 



(13) 



(14) 



(15) 



Note that n = k + 0(-^-£) for the distribution of the aj's 
considered by Gent and Walsh. 



For k > k c the ground state energy Eq reads 



£o=T = .hvY, A 



y-N 



(19) 



This equation specifies the rigorous result that the me- 
dian value of E is 0{\fN2~ N ) §. 

To check Eq. ([l9]) we consider the continous variant 
of number partitioning, where the at are real numbers, 
uniformely distributed in the interval [0,1). In our for- 
malism this means Aa — > and £^ . fl2 = N/3. We are in 
the k > k c regime and Eq. ( |l9|) becomes 



Tn = 



-itN2 



-N 



1.447 VN 2 



-N 



(20) 



In Fig. (|]), Eq. ( ^p| ) is compared to numerical data. The 



agreement is convincing. The prefactor 



fits much 



better than the prefactor J-^ — 0.2662, reported in M. 

To check whether k(N) is a control parameter with a 
phase transition at k c (N), we did numerical simulations. 
For fixed N and k we calculated the fraction of instances 
that have at least one perfect partition. In accordance 
with Gent and Walsh 101 we find that this fraction is 
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FIG. 1. Average minimum residue of the number parti- 
tioning problem with real numbers < a% < 1 compared to 
the analytical result Eq. ( poj ) (straight line) . Each data point 
is the average over 10 4 random samples. 
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FIG. 2. Phase diagram of the random number partioning 
problem. Nk is essentialy the number of bits to encode the 
input numbers, see Eq. (|l5|). The squares denote the phase 
boundary found numerically. The solid line is given by k c 
from Eq. (Q. For k < k c , the zero temperature entropy 
is extensive and a search algorithm typically finds quickly 
one of the 0(2^) perfect partitions. For k > k c , no perfect 
partitions exist, and the optimization problem has a hard to 
find, unique solution. 



1 for small k and for larger k. The transition from 
1 to is sharp. Fig. (||) shows the numerically found 
transition points for 10 < N < 28 compared to k c (N) 
from Eq. (|l4j). Again the agreement is convincing. Note 
that k c (N —> oo) = 1. The asymptotic estimate 0.96 
given by Gent and Walsh is probably due to the rather- 
small values N < 30 used in their simulations. 

The two phases are very different with respect to 
the computational complexity of the corresponding in- 
stances. For k < k c , a search algorithm is likely to 
find one of the numerous perfect partitions in short time, 
while the unique minimum partition for k > k c requires 
exponential time to be found. This behavior can indeed 
be seen in numerical experiments [|||J§1 ■ Refering to the 
solvability of the typical instance we call the two phases 
"easy" and "hard" . In a numerical investigation, the 
precision of the numbers is fixed and N is varied. For 
high precision and small values of N, we are in the "hard" 
phase: no perfect partition exists, and a search algorithm 
has to explore large parts of the configuration space. If 
one increases N, the search space growth exponentially, 
and so does the running time. On the other hand, in- 
creasing N gets us closer to the phase boundary. Beyond 
this threshold, the number of perfect partitions increases 
exponentially with N. A smart algorithm will try to find 
a perfect partition as early as possible. As can be seen 
from the experiments with the CKK algorithm |^,^,^), 
this may even lead to the effect that the running time 
now decreases with increasing N. As a function of N 
the running time has a sharp maximum at the phase 
boundary: The hardest problems are those close to the 
threshold. A similar behavior has been found in other 
NP-complete problems like, for example, the Satisfiabil- 
ity problem (l0fjl2| . 

For bounded, integer values < a* < A even the 
worst case complexity of number partioning is polyno- 
mial in A and N W. This is no contradiction to the 
NP-completenes since a concise encoding of an instance 
requires N log 2 A bits, and A is not bounded by a polyno- 
mial function of log 2 A. Due to this property the number 
partitioning problem is called pseudo polynomial Q . The 
exponential complexity of number partitioning relies on 
the fact that extremely large (or precise) input numbers 
ai are allowed.^ Pseudo polynomiality applies if the num- 
ber of bits to represent ai is fixed while N increases. This 
means, that k decreases, i.e. we get into the "easy" phase 
for large enough N. Hence the notion "easy" not only 
refers to the typical (as shown here) but also to the worst 
case complexity. This is a notable feature of the number 



1 This distinguishes number partitioning from many other 
NP-complete problems like for example, the travelling sales- 
man problem, which remains NP-hard even if the distances 
are restricted to take on the values 1 and 2. 
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partitioning problem: The statistical mechanics results 
hold beyond the typical case for which they are derived. 

Looking at Eqs. (|l^, |l^) an interpretation of the pa- 
rameter k c suggests itself. Let the iV numbers dj each 
be represented by Nk bits. Now consider the residue E 
bitwise: About half of all partitions will set the most 
significant bit of E to zero. Among those partitions, 
about one half will set the second most significant bit to 
zero, too. Repeating this procedure we can set at most 
Nk c (N) bits to zero until running out of available parti- 
tions. If k < n c , we get a perfect partition before reaching 
this point. The remaining set of available partitions has 
2N(k, c -k) e i emen ts. This explains the zero temperature 
entropy, Eq. (|l3|). For k > n c , the N(k > k c ) least signif- 
icant bits in E can not be fixed by the optimum partition, 
leading to Eq. ([ll]) for the residue. 
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